Characteristic Gene Selection via Weighting Principal Components by Singular Values
نویسندگان
چکیده
Conventional gene selection methods based on principal component analysis (PCA) use only the first principal component (PC) of PCA or sparse PCA to select characteristic genes. These methods indeed assume that the first PC plays a dominant role in gene selection. However, in a number of cases this assumption is not satisfied, so the conventional PCA-based methods usually provide poor selection results. In order to improve the performance of the PCA-based gene selection method, we put forward the gene selection method via weighting PCs by singular values (WPCS). Because different PCs have different importance, the singular values are exploited as the weights to represent the influence on gene selection of different PCs. The ROC curves and AUC statistics on artificial data show that our method outperforms the state-of-the-art methods. Moreover, experimental results on real gene expression data sets show that our method can extract more characteristic genes in response to abiotic stresses than conventional gene selection methods.
منابع مشابه
A new weighting approach to Non-Parametric composite indices compared with principal components analysis
Introduction of Human Development Index (HDI) by UNDP in early 1990 followed a surge in use of non-parametric and parametric indices for measurement and comparison of countries performance in development, globalization, competition, well-being and etc. The HDI is a composite index of three indicators. Its components are to reflect three major dimensions of human development: longevity, knowledg...
متن کاملA Bayesian Shrinkage Approach for AMMI Models
Linear-bilinear models, especially the additive main effects and multiplicative interaction (AMMI) model, are widely applicable to genotype-by-environment interaction (GEI) studies in plant breeding programs. These models allow a parsimonious modeling of GE interactions, retaining a small number of principal components in the analysis. However, one aspect of the AMMI model that is still debated...
متن کاملFree Vibration of Annular Plates by Discrete Singular Convolution and Differential Quadrature Methods
Plates and shells are significant structural components in many engineering and industrial applications. In this study, the free vibration analysis of annular plates is investigated. To this aim, two different numerical methods including the differential quadrature and the discrete singular convolution methods are performedfor numerical simulations. Moreover, the Frequency values are obtained v...
متن کاملMining large-scale Genomic and Proteomic Data: Algorithms, Tools and Inference
Motivation: Many methods have been developed for selecting small informative feature subsets in large noisy data. However, unsupervised methods are scarce. Examples are using the variance of data collected for each feature, or the projection of the feature on the first principal component.Weproposeanovel unsupervisedcriterion, basedonSVDentropy, selecting a feature according to its contribution...
متن کاملBlock similarity in fuzzy tuples
A common problem in decision-making is to analyze a tuple of numerical values associated with options, such as the degree of satisfaction assigned by experts to alternatives or probability values for hypotheses computed from data. With no loss of generality, it is assumed that the tuple contains values in the unit interval. For post-processing of typical value(s), singular values that may arise...
متن کامل